Fitness Function Comparison for GA-Based Feature Construction

نویسندگان

  • Leila Shila Shafti
  • Eduardo Pérez
چکیده

When primitive data representation yields attribute interactions, learning requires feature construction. MFE2/GA, a GA-based feature construction has been shown to learn more accurately than others when there exist several complex attribute interactions. A new fitness function, based on the principle of Minimum Description Length (MDL), is proposed and implemented as part of the MFE3/GA system. Since the individuals of the GA population are collections of new features constructed to change the representation of data, an MDL-based fitness considers not only the part of data left unexplained by the constructed features (errors), but also the complexity of the constructed features as a new representation (theory). An empirical study shows the advantage of the new fitness over other fitness not based on MDL, and both are compared to the performance baselines provided by relevant systems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Multi-Mode Resource-Constrained Optimization of Time-Cost Trade-off Problems in Project Scheduling Using a Genetic Algorithm

In this paper, we present a genetic algorithm (GA) for optimization of a multi-mode resource constrained time cost trade off (MRCTCT) problem. The proposed GA, each activity has several operational modes and each mode identifies a possible executive time and cost of the activity. Beyond earlier studies on time-cost trade-off problem, in MRCTCT problem, resource requirements of each execution mo...

متن کامل

An application of a GA with Markov network surrogate to feature selection

Surrogate models of fitness have been presented as a way of reducing the number of fitness evaluations required by evolutionary algorithms. This is of particular interest with expensive fitness functions where the time taken for building the model is outweighed by the saving of using fewer function evaluations. In this paper, we show how a Markov network model can be used as a surrogate fitness...

متن کامل

Genetic algorithm based feature selection for target detection in SAR images

A genetic algorithm (GA) approach is presented to select a set of features to discriminate the targets from the natural clutter false alarms in SAR images. Four stages of an automatic target detection system are developed: the rough target detection, feature extraction from the potential target regions, GA based feature selection and the final Bayesian classification. A new fitness function bas...

متن کامل

Genetic Algorithm Based Feature Selection and Unbiased Protocol for Classification of Breast Cancer Datasets

Feature selection is an essential pre-requisite before classification and diagnosis of a cancer disease. Several studies have been done using Genetic Algorithm (GA) and machine learning techniques that aim to select the relevant features by wrapping the classification algorithm as GA fitness function. However, the performance of GA based feature selection is always focusing on a same datasets t...

متن کامل

Comparison of SGA and RGA based Clustering Algorithm for Pattern Recognition

-In this paper Genetic Algorithm based clustering Algorithm has been studied for pattern recognition. The searching capability of genetic algorithms is exploited in order to search for appropriate/optimal cluster as well as cluster’s center in the feature space such that inter-cluster distance (Homogeneity) and intra-cluster distances (Separation) are optimized. We use H-S ratio for computation...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007